Federated learning based on stratified sampling and regularization
نویسندگان
چکیده
Abstract Federated learning (FL) is a new distributed framework that different from traditional machine learning: (1) differences in communication, computing, and storage performance among devices (device heterogeneity), (2) data distribution volume (data (3) high communication consumption. Under heterogeneous conditions, the of clients varies greatly, which leads to problem convergence speed training model decreases cannot converge global optimal solution. In this work, an FL algorithm based on stratified sampling regularization (FedSSAR) proposed. FedSSAR, density-based clustering method used divide overall client into clusters, then, some available are proportionally extracted clusters participate realizes unbiased for reduces aggregation weight variance client. At same time, when calculating local loss function, we limit update direction by regular term, so optimized globally direction. We prove FedSSAR theoretically experimentally, demonstrate superiority comparing it with other algorithms public datasets.
منابع مشابه
Stratified Sampling Meets Machine Learning
This paper solves a specialized regression problem to obtain sampling probabilities for records in databases. The goal is to sample a small set of records over which evaluating aggregate queries can be done both efficiently and accurately. We provide a principled and provable solution for this problem; it is parameterless and requires no data insights. Unlike standard regression problems, the l...
متن کاملStratified Sampling Design Based on Data Mining
OBJECTIVES To explore classification rules based on data mining methodologies which are to be used in defining strata in stratified sampling of healthcare providers with improved sampling efficiency. METHODS We performed k-means clustering to group providers with similar characteristics, then, constructed decision trees on cluster labels to generate stratification rules. We assessed the varia...
متن کاملSupport Vector Machine based on Stratified Sampling
Support vector machine is a classification algorithm based on statistical learning theory. It has shown many results with good performances in the data mining fields. But there are some problems in the algorithm. One of the problems is its heavy computing cost. So we have been difficult to use the support vector machine in the dynamic and online systems. To overcome this problem we propose to u...
متن کاملOn stratified randomized response sampling
In this paper, we propose a new quantitative randomized response model based on Mangat and Singh [7] two-stage randomized response model. We derive the estimator of the sensitive variable mean, and show that our method is more efficient than other randomized response models suggested by Greenberg et al. [3] and Gupta et al. [4] estimators.
متن کاملStratified Median Ranked Set Sampling: Optimum and Proportional Allocations
In this paper, for the Stratified Median Ranked Set Sampling (SMRSS), proposed by Ibrahim et al. (2010), we examine the proportional and optimum sample allocations that are two well-known methods for sample allocation in stratified sampling. We show that the variances of the mean estimators of a symmetric population in SMRSS using optimum and proportional allocations to strata are smaller than ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Complex & Intelligent Systems
سال: 2022
ISSN: ['2198-6053', '2199-4536']
DOI: https://doi.org/10.1007/s40747-022-00895-3